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Abstract 


This  paper  provides  users  of  factor  analysis  with  an  application 
oriented  framework  for  choosing  an  appropriate  rotation. 


A  Usage  Based  Framework  for  Choosing 
Appropriate  Rotation  in  Factor  Analysis 


Introduction 


Factor  analysis  plays  an  important  role  in  marketing  research.   It 
is  widely  used  both  by  academicians  and  practitioners.   While  the 
basic  models  used  in  factor  analysis  are  well  grounded  in  mathematical 
theories  of  linear  algebra  and  statistics,  considerable  maturity  and 
judgment  is  required  in  interpreting  the  results  of  a  factor  analysis 
"solution."   For  example,  Dielman,  Cattell,  and  Wagner  (1972)  intro- 
duce their  paper  saying,  "It  is  not  unusual  for  the  type  in  factor 
analysis  to  consider  rotation  as  just  one  of  half-a-dozen  technical 
steps  in  the  whole  analysis.   No  greater  mistake  could  be  made,  for, 
as  the  experienced  researcher  realizes,  issues  of  considerable  moment, 
such  as  the  theoretical  alternatives  in  Cattell's  and  Eysenck's  struc- 
turing of  neuroticism  and  extraversion,  hinge  alternatively  on  rota- 
tional resolutions  (p.  223). 

The  typical  flow  of  decisions  to  be  made  in  the  use  of  factor 
analysis  is: 

•  Technique  of  factoring 

•  Number  of  factors  to  be  used 

•  Choice  of  rotation 

•  Interpretation  and  usage  of  information  from  the  analysis. 
Considerable  considerable  guidance  is  available  for  the  choice  of 

techniques  and  in  deciding  upon  the  number  of  factors  to  retain.   How- 
ever, the  choice  of  which  rotation  technique  to  use  has  received  less 
attention.   Furthermore,  most  of  the  advice  is  technical  in  nature. 
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What  we  need  is  an  understanding  of  which  rotation  is  appropriate 
depending  upon  the  specific  use  of  the  researcher.   The  purpose  of 
this  paper  is  to  provide  a  framework  of  choosing  a  specific  factor 
rotation  based  on  a  usage  framework. 

Why  A  Particular  Rotation? 

As  Strang  puts  it,  "for  any  orthogonal  matrix  of  order  p,  the 
factor  matrix  F  =  FQ  accounts  for  exactly  the  same  correlations 
FF   =  FQQ  F   =  FF   that  F  does.   Therefore,  F  and  F  are  completely 
interchangeable.   In  a  typical  problem,  the  original  factors  may  have 
substantial  loadings  on  dozens  of  variables,  and  such  a  factor  is 
practically  impossible  to  interpret.   It  has  mathematical  meaning  as  a 
vector  of  f  ,  but  no  useful  meaning  to  a  social  scientist.   Therefore 
he  tries  to  choose  a  rotation...  ."  (Strang,  1976,  p.  216). 

For  marketing  researchers,  the  hard  work  really  begins.   Inter- 
pretation is  the  key  to  meaningful  use  of  factor  analysis.   And,  a  key 
to  meaningful  interpretation  is  the  perspective  or  rotation  brought  to 
bear  upon  the  factor  solution. 

In  an  applied  field  such  as  market  research,  interpretation  of  the 
data  analysis  is  a  key  to  the  meaningful  use  of  factor  analysis.   The 
hard  work  really  begins  after  the  factors  have  been  extracted  and  the 
market  researcher  must  decide  on  which  rotation  technique  to  use. 
Unfortunately,  the  lack  of  determinancy  and  choice  of  infinite  analytic 
solutions  creates  a  problem  of  ensuring  the  right  choice  which  is 
based  on  nonstatistical  criteria. 
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Let  us  consider  an  example  to  illustrate  the  importance  played  by 
rotation  in  interpretation.   In  an  analysis  of  some  census  tract  in- 
formation (the  variables  being  total  population,  median  school  years, 
total  employment,  miscellaneous  professional  services,  and  value  of 
median  value  house)  Harman  (1967)  obtained  a  two-factor  solution. 
The  factor  loadings  he  obtained  are  shown  in  Exhibit  1.   As  shown  in 
the  inset  to  Exhibit  1,  the  two-factor  solution  explain  93.5  percent 
of  the  variance  explained  by  the  original  five  variables.   But  it  is 
difficult  to  interpret  the  two-factor  solution.   All  the  variables, 
except  variable  4,  load  moderately  or  very  heavily  on  both  the  fac- 
tors.  If  our  interest  is  in  drawing  a  rich  meaning  from  the  factors, 
we  would  seek  to  achieve  Thurstone's  (1947)  simple  structure  to  at- 
tain which  rotation  is  needed.   A  simple  rotation  that  is  visually 
quite  obvious,  in  this  simple  two  factor,  five  variables  example,  is 
shown  in  Exhibit  2.   Such  a  rotation  leads  to  the  factor  loadings 
matrix  shown  in  the  inset  to  Exhibit  2.   It  is  very  clear  both  from 
the  graph  as  well  as  from  the  loadings  matrix  that  factor  1  (Ml)  is  a 
clean  composite  of  variables  2  and  5.   Factor  2  (M2)  is  loaded  upon 
heavily  by  variables  1  and  3.   Variable  4  loads  heavily  on  factor  1 
and  moderately  on  factor  2,  leading  to  some  possible  ambiguity. 
Factor  Ml  can  be  interpreted  (as  in  Harman)  as  a  factor  explaining 
aspects  of  overall  size  of  the  tract.   Factor  M2  appears  to  explain 
aspects  of  the  quality  (or  attainment  level)  of  a  census  tract.   The 
value  of  factor  rotation  in  correctly  interpreting  a  factor  analysis 
solution  is  thus  immeasurable. 
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Principal-component  solution 
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EXAMPLE  WITHOUT  ROTATION 
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EXHIBIT   2:      WITH   ROTATION 


Uses  for  Factor  Analysis  in  Marketing  and  their  Relationship 

to  Rotation 

There  are  four  major  types  of  uses  for  factor  analysis  in  market- 
ing research.   These  are: 

1.  To  provide  insights  into  the  more  fundamental  and  underlying 
factors  which  create  a  structure  of  systematic  relationships 
among  observed  phenomena.   For  example,  ascertaining  the 
factors  underlying  systematic  preferences  for  various  types 
of  liquors. 

2.  To  provide  a  typology  or  classification  of  a  large  number  of 
entities.   For  example,  development  of  a  typology  of  magazine 
readers  based  on  their  reading  of  over  50  popular  magazines. 

3.  To  bring  out  in  bold  relief  salient  relationships  so  that 
researchers  can  cull  the  relevant  variables  before  further 
analysis. 

In  the  task  of  building  better  forecasting  and  predictive  models, 
one  seeks  to  build  parsimonious  models.   Factor  analysis  provides  a 
way  of  selecting  the  relevant  predictor  variables  from  a  larger  set 
of  variables.   This  pruning  is  particularly  critical  when  given  only 
a  few  observations,  we  need  to  reduce  the  large  number  of  predictor 
variables  to  an  optimal  number  to  preserve  the  degrees  of  freedom. 

4.  To  transform  data  so  as  to  make  them  more  representative  of 
the  assumptions  of  powerful  predictive  models. 

Example  1:   To  transform  data  to  meet  the  conditions  of 
normality,  independence,  and  random  error  assumptions 
of  regression  analysis. 
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Example  2:   To  transform  available  data  to  meet  the 
conditions  of  independence  and  homogeneity  indiscrirainant 
analysis . 

Yet  another  possible  use  could  be  a  factor  analytic  solution  as 
part  of  a  higher  order  factor  analysis  (Rummel,  1970).   In  higher  order 
factor  analysis,  a  first  stage  solution  identifies  a  factor  structure 
(reduced  space)  that  explains  the  interrelationships  in  the  original 
data  (say  among  a  large  set  of  variables).   The  analyst  expects  or 
wishes  to  explore  the  possibility  that  an  even  more  parsimonious 
description  of  the  interrelationships  is  possible.   Thus,  he  would 
perform  a  higher  order  factor  analysis  of  a  factor  solution  to  obtain 
this  more  parsimonious,  and  higher  order  solution.   We  have  summarized 
this  discussion  in  Exhibit  3. 

Several  rotation  schemes  have  been  developed  and  incorporated  in 
the  numerous  statistical  analysis  packages  available  to  users.   These 
schemes  are  broadly  classified  into  (i)  oblique,  and  (ii)  orthogonal 
rotation  schemes.   Furthermore,  one  can  rotate  the  factor  structure  to 
simplify  and  therefore  get  better  understanding  of  the  (a)  variables  or 
the  (2)  underlying  factors. 

The  outcome  of  using  an  orthogonal  rotation  is  the  development  of 
a  reduced  factor  space,  where  the  factors  are  orthogonal  (or  "inde- 
pendent").  Oblique  rotations,  in  general,  lead  to  factors  that  are  not 
orthogonal  (i.e.,  the  axes  of  the  new  reduced  space  are  correlated). 

Further,  each  rotation  scheme,  as  mentioned  earlier,  has  a  dif- 
ferent objective  that  it  seeks  to  satisfy  optimally.   We  provide  a 
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brief  description  of  the  criterion  for  each  scheme  in  the  Appendix. 
These  criteria  lead  to  either  an  ease  of  interpretation  by  focusing  on 
the  emergent  structural  relationships  between  different  variables,  or 
by  focusing  on  the  interpretation  of  the  factors  that  underlie  the 
observed  (or  manifest)  variables. 

Based  on  the  criterion  that  is  used  in  each  of  the  rotation  schemes, 
we  have  classified  the  more  generally  available  and  popular  schemes  on 
the  basis  of  (i)  whether  they  perform  oblique  or  orthogonal  rotations, 
and  (ii)  whether  they  aid  interpretation  of  the  variables  or  the 
underlying  factors.   Exhibit  4  provides  this  classification. 

As  can  readily  be  seen  there  is  a  match  between  Exhibits  3  and  4. 
Thus,  we  have  identified  a  typology  of  possible  marketing  applications 
of  factor  analysis  that  will  allow  a  choice  of  the  appropriate  rotation. 
To  make  our  exposition  clearer,  and  the  framework  easy  to  use,  let  us 
consider  some  marketing  examples. 

Stoetzel  in  1960,  was  interested  in  identifying  the  dimensions 
underlying  preferences  for  liquor  among  French  consumers.   His  concern 
was  with  the  factors  (or  the  latent  dimensions)  based  on  which  prefer- 
ences were  developed  for  different  liquors.   His  concern  was  not  with 
specific  liquors  but  with  the  underlying  preference  dimensions  on  which 
all  types  of  liquors  are  preferred.   He  was  thus  interested  in  identi- 
fying the  latent  factors  and  the  relationships  among  these  factors. 
This  clearly  is  an  example  of  box  1  in  Exhibit  4.   A  correct  choice 
should  have  been  the  use  of  an  obliqued  rotation  which  focuses  on 
simplification  or  interpretation  of  factors.   This  includes  oblique 
Varimax  or  its  variations.   However,  Stoetzel  used  an  orthogonal 
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rotation.   Having  developed  a  three  factor  solution,  he  labeled  these 
factors  as  (1)  sweetness  strength,  (2)  low  price-high  price,  and 
(3)  regional  popularity.   There  could  be  an  interaction  between 
factors  1  and  3,  and  2  and  3.   An  oblique  rotation  would  bring  this 
out  in  a  clear  fashion.   An  orthogonal  rotation  on  the  other  hand 
would  force  factor  independence  and  obscure  any  such  relationships. 
Wells  and  Sheth  in  1974,  reported  on  a  study  typical  of  studies 
used  for  clustering  and  segmentation.   Their  primary  objective  was 
to  analyze  the  frequency  of  readership  of  30  magazines  and  identify 
a  classification  of  magazine  types.   This  was  to  further  enable  them 
to  draw  implications  regarding  the  commonality  of  audience  provided 
by  each  grouping  of  magazines.   Based  on  their  analysis  they  report 
groupings  of  magazines.   Some  of  these  are  (i)  Car  and  Driver,  Road  and 
Track.,  Motor  Trend  and  Hot  Rod,  (ii)  Fortune,  Forbes,  Time,  and 
Business  Week,  (iii)  Field  and  Stream,  Outdoor  Life,  and  Sports 
Afield,  and  so  on.   Their  focus  is  on  the  variables  (the  magazines 
for  which  they  obtained  readership  frequency  data.   By  allowing  an 
oblique  rotation,  they  could  have  permitted  a  more  natural  interpre- 
tation of  their  analysis.   By  performing  a  Varimax  (orthogonal)  rota- 
tion, they  precluded  the  possibility  of  magazines  appearing  in  more 
than  one  group.   For  example,  Reader's  Digest  does  not  have  high 
loadings  on  any  of  the  factors,  in  spite  of  attempting  to  force  it  to 
load  heavily  on  just  one  factor  (using  Varimax).   Its  positive  load- 
ings are  divided  among  a  "news  group"  (U.S.  News  and  Newsweek),  a 
"general  reading"  group  (Life,  Look,  Saturday  Evening  Post),  and  a 
"men's  fiction"  group  (Argosy  and  True).   But,  these  loadings  are 


extremely  likely  to  change,  if  our  objective  allows  the  spreading  of 
loadings  by  permitting  correlated  (or  oblique)  factors. 

For  cell  3,  the  study  by  Twedt  (1952)  of  advertising  readership 
is  a  good  example.   He  was  interested  in  identifying  the  determinants 
of  advertising  readership.   His  was  a  search  for  the  most  promising 
variables  that  correlated  with  advertising  readership.   He  obtained 
a  recall-based  readership  measure  on  122  advertisements  of  1/4  page 
or  more  in  size  from  a  single  issue  of  American  Builder,  a  trade 
magazine.   Based  on  judgment,  34  variables  describing  the  mechanical 
and  content  aspects  of  these  advertisements  were  chosen.   By  study- 
ing the  correlations  between  these  34  variables  and  his  readership 
variable,  he  chose  19  predictor  variables  (out  of  34).   A  factor 
analysis  of  these  19  variables  along  with  the  criterion  variable 
yielded  6  meaningful  factors.   These  factors  were  orthogonally 
rotated  before  interpretation.   He  wished  to  identify  the  variables 
that  were  the  most  relevant  as  predictor  variables,  and  could  be 
used  as  predictor  variables  in  a  multiple  regression  model.   So  his 
focus  was  on  the  variable  and  he  wished  to  obtain  independent  (or 
orthogonal)  variables  to  use  in  his  multiple  regression  model.   This 
model  resulted  in  a  multiple  of  0.75.   When  an  additional  six  vari- 
ables were  added  to  the  model,  the  multiple  R  increased  only  by  0.04 
which  suggests  the  adequacy  of  his  factor  analysis  solution  in 
identifying  the  relevant  variables  which  were  determinant  of  adver- 
tising readership. 

Deshpande  (1982),  provides  us  an  example  for  cell  4.  His  study 
was  aimed  at  understanding  the  organizational  context  of  market  re- 
search use.   He  wished  to  develop  a  multiple  regression  model  that 
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would  predict  research  utilization.   Factor  analysis  was  used  to 
develop  a  set  of  predictor  variables  for  this  model.   These  predictor 
variables  were  thus  to  be  uncorrelated  from  a  set  of  23  variables 
(that  were  concerned  with  measuring  different  elements  of  perceptions 
of  organizational  structure)  he  developed  a  smaller  set  of  indices 
that  would  be  the  latent  variables  giving  rise  to  the  manifest  var- 
iables.  These  indices  (or  factor  scores)  were  providing  the  meaning 
of  being  the  latent,  or  causal,  dimensions  of  organizational  struc- 
ture perceptions.   His  focus  was  clearly  on  identifying  and  modeling 
the  factor  structure.   His  factors  (or  indices)  were  to  be  orthogonal, 
and  thus  clearly  the  method  of  rotation  (Varimax)  chosen  by  him  follows 
from  our  framework  as  the  appropriate  choice. 

Conclusions 


Hair,  Anderson,  Tattan,  and  Grablowsky  (1979,  p.  230),  note  "No 
specific  rules  have  been  developed  to  guide  the  analyst  in  selecting  a 
particular  ...  rotational  technique."   We  urge  our  readers  not  to 
choose  a  rotation  scheme  just  because  Varimax  is  a  familiar  and  common 
scheme,  but  to  reason  through  the  raison  d'etere  of  their  analysis,  to 
arrive  at  the  appropriate  rotation  scheme  which  will  aid  in  better 
(and  more  correct)  substantive  interpretation.   Towards  this  end,  in 
this  paper,  we  have  provided  a  usage  based  choice  framework  explicitly 
for  this  purpose. 
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APPENDIX 
A  Technical  Summary  of  Major  Rotation  Schemes 


ORTHOGONAL  SCHEMES 


Scheme 

Obiective  Function 

Comments 

m    p 
Max  Q  =   E    ^      a   o 
3=1   1=1      j£ 

QUARTIMAX 

This  function  increases  the 

high,  and  decreases  the 

middle  loadings  for  a 

where,  ctj£  =  factor  loadi 

ng  of 

variable.   It  approximates 

variable  x.:  on  factors  Sjj 
m  =  number  of  variables, 

» 

simple  structure,  but  has 

a  bias  toward  loading  more 

p  =  number  of  factors. 

variance  on  the  first 
factor. 

VARIMAX 

P   m   V  4 
Max  V  =  m   E    Z      Or-^) 

Very  commonly  used  (some- 

times inappropriately) 

orthogonal  rotation. 

2 

-  z  ( z  -j±r 

1=1   3=1   h. 

J 

Approximates  simple  struc- 

ture. 

where,  v  =  variance  of 

normalized  factors,  h?  = 

munalitv  of  variable  x.. 

J 

conr 

OBLIMAa 


Max  K  - 
kurtosis 


m   p 
E    E 

.1=1  1=1 


OBLIQUE  SCHEMES 

4 

J* 


m    p 
(  E    E 


V 


where  a-j£  =  oblique  reference 
structure  loading. 


Attempts  to  increase  the 
number  of  high  and  low 
loadings  and  decrease 
those  in  the  middle.   Good 
to  use  only  if  data  have 
clean  simple  structure. 


APPENDIX  (continued) 


OBLIQUE  SCHEMES 


Scheme 

Objective  Function 

Comments 

QUART I MIN 

P    m   0   2 
Min  Q  =   Z          Z   a"  a. 
Ka=l  j  =  l   J   J^ 

Attempts  to  increase  the 

high  loadings  of  a  variable 

on  one  factor  and  decrease 

where  dj£  =  oblique  reference 

its  loadings  on  others. 

structure  loading. 

Results  oftentimes  in 
highly  intercorrelated 
(0.5,  often  >0.7)  factors. 

2 
P      m  a- » 
Min  C  =    S   {m  Z  (-4%(- 

Kq=l   j  =  l  h.   1 

2 

CCOVARIMIN 

-LI) 
h2 

Tends  to  produce  solutions 

very  close  to  orthogonal 

J 

J 

Varimax  solutions  (i.e., 

2         2 
m   a      ra  a. 

-  [  2  (-£*-)][  z  (-1 

j=l  h.     j=l  h. 

with  low  intercorrelations) 

a-)] 

BIQUARTIMIIv 

c 
Min  B  =  Q  +  — 

m 

Good  blend  between 

111 

Quartimin  and  Covarimin. 
Generally  provides  a  more 

Q  =  Quartirain  function 

satisfactory  simple  struc- 
ture solution  than  either 

C  =  Covarimin  function 

of  those  (in  terms  of 
interfactor  correlations 
and  factor  loadings). 

APPENDIX  (continued) 


Schene 


OBLIQUE  SCHEMES 
Obiective  Function 


Comments 


OBLIMIN 


BINORMAMIN 


Min  B*  =  a  Q  +  3-  — 

1     l   m 


or 


Min  B*  = 


Z      {m  Z    (-§*■)„ 
Kq=l       j=l  hT 


2  2 

m   a       ma. 

j-i  h.    j-i  h. 


Y  = 


1  2 


Min  B**  =   Z 


!■ 


m   9   „   9   „ 
2  (at  /hf)(a:  /hf) 

j=1  j&  j   jq  j 

[  E  (a. ,/ht)][  Z   (aZ  /hf)] 


•} 


This  is  a  general  class  of 
Biquartimin  functions. 
With  y  =  0.5  you  get 
Biquartimin  solutions, 
with  Y  =  1.0,  you  get 
Covarimin  and  if  Y  =  0, 
you  get  Quartimin. 


This  appears  best  suited 
for  data  with  either  very 
clear,  or  very  complex 
structure.   For  in-between 
data,  use  Biquartimin. 


APPENDIX  (continued) 


OBLIQUE  SCHEMES 


Scheme 

Obiective  Function 

Comments 

Oblique 

P  -  QM1/2T  D 

Commonly  used  oblique 

Varimax 

scheme.   Generally  produces 

(or  Harris- 

satisfactory  simple  struc- 

Kaiser 

where  P  is  the  ultimate  factor 

ture  solutions. 

Case  II) 

pattern  matrix,  Q  is  an 
orthonormal  matrix  of  latent 
vectors,  M  is  a  diagonal  matrix 
of  latent  roots,  T]_  is  a  square 
and  orthonormal  transformation 
matrix  based  on  criterion  such 
as  Varimax,  Equiraax,  or 
Quartimax,  and 

2 

D  -  P'P 

MAXPLANE 

Max  N 

It  is  more  efficient  on 
large  samples  and  yields 
solutions  close  to 

where  N  is  the  number  of  var- 

graphical intuition. 

iables  falling  within  a  pre- 

specified  hyperplane  width. 

PROMAX 

First  obtains  a  Varimax  solu- 

If data  dictates  orthogonal 

tion,  then  relaxes  orthogonality 

solution,  Promax  usually 

to  obtain  better  fit  to  simple 

reaches  this  solution. 

structure. 

Its  solutions  approach 
simple  structure. 
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JUN95 

,„  ..»  N.  MANCHESTER 


